Principles of Data Integration

ثبت نشده
چکیده

The World Wide Web offers a vast array of data in many forms. The majority of this data is structured for presentation not to machines but to humans, in the form of HTML tables, lists, and forms-based search interfaces. These extremely heterogeneous sources were created by individuals around the world and cover a very broad collection of topics in over 100 languages. Building systems that offer data integration services on this vast collection of data requires many of the techniques described thus far in the book, but also raises its own unique challenges. While the Web offers many kinds of structured content, including XML (discussed in Chapter 11) and RDF (discussed in Chapter 12), the predominant representation by far is HTML. Structured data appears on HTML pages in several forms. Figure 15.1 shows the most common forms: HTML tables, HTML lists, and formatted “cards” or templates. Chapter 9 discusses how one might extract content from a given HTML page. However, there are a number of additional challenges posed by Web data integration, beyond the task of wrapping pages.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design of Oil Refineries Hydrogen Network Using Process Integration Principles

This paper describes the application of process integration principles to the design of oil refineries hydrogen network. In this regard, a design hierarchy as well as heuristics and required guidelines are proposed. The recommended rules compensate lack of procedure to the design and make the design process easier. The guiding principles of the design are based upon pinch technology and ext...

متن کامل

GPS/INS Integration for Vehicle Navigation based on INS Error Analysis in Kalman Filtering

The Global Positioning System (GPS) and an Inertial Navigation System (INS) are two basic navigation systems. Due to their complementary characters in many aspects, a GPS/INS integrated navigation system has been a hot research topic in the recent decade. The Micro Electrical Mechanical Sensors (MEMS) successfully solved the problems of price, size and weight with the traditional INS. Therefore...

متن کامل

Clinical Governance in Primary Care Principles, Prerequisites and Barriers: A Systematic Review

Introduction: Primary care organizations are the entities through which clinical governance is developed at local level. To implement clinical governance in primary care, awareness about principles, prerequisites and barriers of this quality improvement paradigm is necessary. The aim of this study is to pool evidence about implementing clinical governance in primary care organizations. Data so...

متن کامل

Optimization of Control System of Petroleum Refinery Isomerization Unit by Plant-Wide Control Principles

Industrial process control system due to integration of equipment, existing material and energy recycle streams and their effects are extraordinary importance. The process in terms of safety, product quality and control stability have a challenge when there was defects in a process control system. The isomerization process is gaining importance in the presence of refining context due to upgradi...

متن کامل

Structured Network Public Spaces a Step Toward Integration of Urban

Network of public spaces composes of a network of interconnected land use and various elements of the city, such as synthetic and natural which shows the city as a whole. Network structure of public spaces is important because understanding this network as a structure presents us the formation of the city. This paper attempts to define the status of the network of public spaces in the city stru...

متن کامل

بررسی هستان شناسی های توسعه یافته مبتنی بر اصول هستان شناسی های منبع باز زیست پزشکی

Background and Aim: Ontologies facilitate data integration, exchange, searching and querying. Open Biomedical Ontologies (OBO) Foundry is a solution for creating reference ontologies. In this foundry, the design of ontologies is based on established principles which allow for their interactions as a single system. The purpose of this study is to determine the main features of ontologies develop...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012